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ABSTRACT 



We estimate the degree to which the baryon density, f2&, can be determined 
from the galaxy power spectrum measured from large scale galaxy redshift 
surveys, and in particular, the Sloan Digital Sky Survey. A high baryon density 
will cause wiggles to appear in the power spectrum, which should be observable 
at the current epoch. We assume linear theory on scales > 20/i~ 1 Mpc and 
q\ • do not include the effects of redshift distortions, evolution, or biasing. With 

an optimum estimate of P(k) to k ~ 27r/(20/?r 1 Mpc), the la uncertainties 
t"*- ■ in fib are roughly 0.07 and 0.016 in flat and open (f2 = 0.3) cosmological 



models, respectively. This result suggests that it should be possible to test for 



consistency with big bang nucleosynthesis estimates of if we live in an open 
^ • universe. 

6 

Subject headings: methods: data analysis — methods: statistical — large scale 
structure of the universe — cosmology: theory 



1. Introduction 

In anticipation of the forthcoming Sloan Digital Sky Survey (hereafter SDSS; see 



http : //www, astro .princeton. edu/BBOOK ; Gunn & Weinberg 1995; Strauss 1997), several 
papers have discussed the use of large redshift surveys to determine of the galaxy power 
spectrum (Vogeley & Szalay 1996; Tegmark et al. 1997ab; Hamilton 1997ab) and parameters 
derived therefrom (Tegmark 1997; Nakamura, Matsubara, & Suto 1997; de Laix & Starkman 
1997). This paper will quantify how well we can constrain the baryon density, Qb, from the 
power spectrum of the SDSS redshift survey. 
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The universal ratio of baryonic mass density to critical mass density, Vt b , is of acute 
cosmo logical interest. In the standard big bang model, the ratio of baryon density to photon 
density, r], uniquely determines the relative abundances of deuterium, 3 He, 4 He and 7 Li. 
Although the number density of photons can be readily measured from the temperature 
and spectrum of the Cosmic Microwave Background Radiation, the measurement of baryon 
density is much more difficult. However, by determining the abundance of deuterium, for 
example, one can compute a unique value of r\ (Peebles 1993). From r\ and the measured 
photon density one can determine Qb, up to uncertainties in the Hubble constant, H . 
However, since deuterium is converted into 3 He in stars, it is necessary to look at very old 
objects in order to get an accurate estimate of the primordial deuterium abundance. Tytler, 
Fan, & Buries (1996) measured the deuterium abundances in high redshift QSO absorption 
lines and determined tt b = 0.024 ± 0.002/i" 2 , where h=H /(100 km s" 1 Mpc" 1 ). We shall 
adopt a value of h = 0.65 (Kundic et al. 1997) throughout this paper, yielding a value of 
Qb = 0.057 ± 0.005 from the Tytler et al. (1996) observations. 

Can the baryon density be independently confirmed? The baryonic mass density 
can have a profound effect on the shape of the expected mass power spectrum on large 
scales. The baryonic component of the density field is strongly coupled to the radiation 
field until z ~ 1100. There are fluctuations in the radiation power spectrum due to sound 
waves crossing the horizon at decoupling (Padmanabhan 1995; Hu & Sugiyama 1996). 
Immediately after decoupling, the baryon power spectrum is the same as that of the 
photons, containing fluctuations, or "wiggles" on large scales. If the matter field is almost 
entirely composed of Cold Dark Matter (CDM), however, the baryons quickly fall into the 
dark matter potential wells, and much of these wiggles are erased. However, the greater the 
proportion of baryons, Qb/^o, the greater the influence of the wiggles in the final (linearly 
evolving) power spectrum. 

The change in shape of the power spectrum through decoupling has been modeled 
analytically by several authors. T(k) is defined as the transfer function of the power 
spectrum such that P{k) oc k nprim T(k) 2 , where n pr i m is the spectral index of the primordial 
power spectrum. Holtzman (1989) computed T(k) for a wide variety of models, including 
some with non-zero values of Qb- Sugiyama (1995; see also White et al. 1996; Peacock & 
Dodds 1994) include a baryon correction term to the analytic fit to the transfer function: 
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(1) 



Here, q = k/hT, Q Q is the ratio of mass density to the critical density, and 
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This functional form is smooth; it does not contain the wiggles discussed above and thus 
this analytic formulation gives results which are entirely degenerate for various values of fib 
and Qo- If we take the wiggles into account, the degeneracy can be broken. 

We use a numerical approach to quantify the effect of wiggles in the power spectrum. 
We use a package called CMBfast, written by Seljak & Zaldarraiga (1996) to calculate the 
transfer function of the mass power spectrum for a wide variety of cosmological models. 

Figure [I] illustrates the differences between the analytic fit of equation ([I]) and the 
numerical approach. In Figure [IJa, we plot four power spectra, each with a value of T = 0.58, 
and a normalization on large scales based on the COBE four year results (Bunn & White 
1997). Each assumes a value of h = 0.65, and for comparison we plot both the numerical 
model and analytical fit with flo = 1, fib = 0.057, the numerical result for flo = 0.89, 
fib = 0, and the numerical result for flo = 1.37, fib = 0.26, which has the same value of 
flb/flo as that found in the open model in panel b. Note that the flf, — numerical model 
and the analytical fit are almost indistinguishable. This is because the two have identical 
values of T and neither breaks this degeneracy with wiggles. Panel b shows an analytic fit 
and a numerical model of an flo = 0.3, fib = 0.057 power spectrum. Panel c shows the ratio 
of the analytic fit to the numerical model power spectrum for the flat, fib = 0.057 case. 
Finally, panel d shows the same relationship for the open case. It is clear from panels c and 
d that the wiggles, while present in the flat universe model, have a much greater effect in 
an open universe. We propose to estimate how well fib can be determined by examining 
these features. The emphasis in this paper is not to present a method of measuring P(k), 
but to estimate errors on quantities on which P{k) depends, especially fib- 

The outline of the paper is as follows. In § [2] we discuss the use of the Fisher information 
matrix in determining parameter uncertainties, and in § ^ we present the details of our 
method. § ^ contains the results of that method for several different cosmological models. 
In § ||, we discuss other complicating factors which enter into the analysis of redshift survey 
power spectra. Finally, in § ||, we present our conclusions. In Appendix A, we present an 
alternate expansion of the SDSS density field, which may be useful for future computational 
work. 



2. How Well Can We Determine Parameters? 

Suppose that we are able to parameterize a power spectrum at the current epoch 
according to some set of parameters, ©. A moderately complete set might include the 
following parameters: = {cr 8 , fl , fib, h, n prim }, where the latter four have been defined 
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previously, and a s is a normalization constant, the rms density fluctuations within 8/i _1 Mpc 
spheres. We assume that other parameters, such as A, are known exactly throughout this 
discussion; we set A = 0. 

In addition to a model, we have some data, di,i = 1, ...,n put into a data vector, d, 
where n is the total number of data elements. Moreover, we have a method to relate the 
raw observables (e.g. positions and redshifts of galaxies) to d. In order to compare d to the 
model, we can associate some likelihood function L(@; d), which we would like to maximize 
in order to find the "true" cosmo logical parameters, 0o, where 

e -dtC-!d/2 

L = (27r)^/ 2 det(C) 1 /2 • (3) 

This form assumes that (di) = and that the expected distribution of d is a multivariate 
Gaussian. 

The covariance matrix, C, is defined as 

Clm = {d t d*J . (4) 

Following Tegmark, Taylor & Heavens (1997, hereafter TTH), we use the Fisher 
information matrix to determine the minimum possible uncertainties in these parameters. 
The Fisher matrix is defined as 

F « - (ik) ■ (5) 

where £ = — InL. This may further be reduced to (Vogeley & Szalay 1996; TTH): 

F t3 = ^Tr [AjAj] , (6) 

where the matrices Aj = C^dC/dOi. 

The Fisher matrix allows us to determine the minimum possible errors for each 
parameter in a reasonable way via the Cramer- Rao inequality (see TTH for a discussion). 
Regardless of how the actual parameters are measured, one cannot hope to do better than: 

M 2 



In the limit where all parameters but the i th are known with certainty, this becomes 
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3. Method 

In this section, we discuss how we define our data d, and determine its covariance 
matrix, C (§ j3.1| ). Our aim is to use equation (|6|) to determine the Fisher information 
matrix F for the SDSS redshift survey power spectrum, and thus put an upper limit on the 
expected error in fib via equation (0). 

The SDSS northern redshift survey will cover an elliptical cone of n steradians to an 
approximate depth of 600/i _1 Mpc (z ~ 0.2), with redshifts for approximately 10 6 galaxies. 

D. Weinberg (private communication) has created a mock SDSS redshift sample based 
on an iV-body simulation by C. Park and J. R. Gott. We have used this mock survey to 
calculate a selection function: 

^) = -E^— » (8) 

i,gal a max,i 

where uj is the solid angle of the survey and d max ^ is the maximum distance that galaxy i 
could be placed and still pass the selection criteria of the survey. The sum is carried out 
only over those galaxies for which d max ,i > r. The selection function for the SDSS is plotted 
in Figure @. We assume that the selection function has no angular dependence. 

Our primary limitations in the calculation of F involve the computation, storage, and 
inversion of the covariance matrix, C. All three of these considerations will be immensely 
simplified if we can make C as close to diagonal as possible. We reduce covariance by an 
appropriate choice for the data d (§ |3.1| ) and with a clever choice of weighting (§ p.2| ). 
For the SDSS geometry, we find that we can indeed approximate the covariance matrix as 
diagonal for scales smaller than 1/8 the largest dimension of the survey (§ |3.4| , |3.5|) . 

3.1. Computing the Covariance Matrix 

We must first specify the form of the dataset, d, that we will use. The method we use 
can be described as a "poor man's" Karhunen-Loeve (K-L) transform. The K-L transform 
(Vogeley & Szalay 1996; TTH) is a method of extracting the statistically independent 
elements of a dataset that provide the maximum ratio of signal to noise and hence most 
strongly constrain the cosmological parameters. Emulating the K-L transforms, we wish 
to minimize the values of the off-diagonal elements of the covariance matrix, since we will 
ultimately need to invert it (equation [3]). If the survey volume were a cube with a uniform 
selection function, the Fourier modes of the density field with wavelengths along each 
axis given by integral fractions of the cube length, would indeed have no covariance for 
non-identical modes; they would be the K-L modes. We thus embed our survey volume in 
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a cube of length L, which is specified by the largest dimension of the survey: 

L 2 T max sin Qmax j (9) 

where r max = 809 h^ 1 Mpc is chosen where n(r) first goes to zero, and 9 max = 1.13 rad is 
the largest off-axis angle of the survey. This gives L = 1466/i~ 1 Mpc. 

We use as our dataset d the Fourier transform of the real space density fieldQ with 

1^n x „ 2im v „ 27m z A 
k(n x , n y , n z ) = — — x + — + — — z , (10) 

where n x , n y and n z are non-negative integers. Because the SDSS geometry is of course 
not cubic, this scheme will not exactly diagonalize the covariance matrix, but we will see 



in § |3.5| that we will be able to make the approximation that it is diagonal at large \k\. 
We order the modes in concentric cubes in k space, such that they are roughly ordered in 
increasing value of |&|P|. We refer to k; as the I th value of k in this ordering. Thus there is a 
rough correspondence between increasing I and decreasing scale. As the IVinduced wiggles 
are prominent on large scales, we will determine our Fisher matrix, and corresponding 
uncertainties on parameters, as a function of n, the maximum value of I which we probe. 

This is clearly not the only basis set which will partially diagonalize the covariance 
matrix from the outset. In Appendix A we discuss a set of basis vectors constructed 
from spherical Bessel functions and the spherical harmonics which will produce very little 
covariance within the SDSS geometry. 

Following Fisher et al. (1993), we estimate the density field of galaxies as a sum of 
discrete points: 

*W = E^fe^-i. (id 

i,gal \ > 

where n(r) is the selection function, the expected density of observed galaxies at r in the 
absence of clustering. We allow ourselves a weighting function, V(r), in order to further 



reduce covariance and to minimize shot noise (see 5 3.2). The Fourier Transform of the 



density field, <5(k), is then taken. Our Fourier convention here and throughout is: 

/(k) = / d 3 r e^fiv) , (12) 



and 

/(r) = (2^p / d " k e " k ' r/ ^ k ) • ( 13 ) 



2 We discuss redshift-space distortions in § p. 4 

3 Note that unlike the K-L modes, we have not ordered the di in decreasing signal-to-noise ratio. 
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The Fourier modes of the weighted density are then: 

<5(k) = / dh e 4k - r V>(r)5(r) . (14) 



Thus d is the set of £(k), for the ordering of k/ described above. Substituting equation (|TTJ) 
gives: 

W-E^p-Wlk), (15) 
where the window function, W(k), is defined as: 

Wik) = J d 3 r ^(r)e ik r . (16) 

Taking the product of any two of these modes, we can compute the covariance: 



\T n ( r *) J \j n ( T j) 



e 



-ik.2-Tj 



-WQci) T t^}\ e -^-r t _ W ( k2 y y tpl^i-n + W ( kl \ W ( k2 y . (17) 
i "fa) i ^( r *) 

Following Fisher et al. (1993) we take the expectation value of this expression and after 
some algebra recover the expression: 

(5(k05(k 2 )*) = „ J d 3 k'p(k')w (ki - k')w(k 2 - k'y + J d*v WKT) e _ . (18) 

If the weighting function is normalized such that / d 3 r ip(r) = 1, the diagonal elements 
of the covariance matrix are: 

C u = Pfa) + P shot , (19) 

where 

P(kl) = (2^ / d3k ' P ( k ')l^( k ' " k ')| 2 > ( 2 °) 

and 



P shot = / d A r^r\ , (21) 
J n(r) 

which matches the result expected from traditional methods of power spectrum estimation 
(Fisher et al. 1993; Feldman, Kaiser, & Peacock 1994, hereafter FKP; Park et al. 1994). 
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3.2. The Optimal Choice for ip(r) 

We now discuss how the weighting function, tjj(r), is chosen. First, the weighting 
function must reflect the survey geometry. That is, it must go to zero outside the survey 
and be well-defined within it. It must also minimize shot noise by giving regions with higher 
n(r) a higher weight. Finally, we wish to minimize covariance between modes. Tegmark 
(1995) gives a clever solution which minimizes covariance in an arbitrary geometry, and 
we shall use his algorithm throughout. The weighting function is the solution to the 
Schrodinger's Equation: 

11 " V(r) = Ei/>{t) , (22) 



n{r) 

5 



where E is the smallest eigenvalue of the system, and 7 is a constant with units of length 
which balances the desire to minimize covariance while maximizing the signal to noise ratio. 
We used 7 = 10 _8 /i 5 Mpc -5 ~ n ga i L~ 5 throughout, but found similar results for values in 
the range 1(T 7 - l(T 9 /i 5 Mpc" 5 . 

Since the SDSS has an isotropic selection function within the survey footprint, 
equation ( p2|) is separable; we plot the radial and angular parts of the wave-function, ^(r) 
in Figure 0, where if>(r) = if)(r)i[)(9) . The actual footprint of the survey has a small but 
finite ellipticity, e ~ 0.1. Rather than introducing a third, azimuthal, coordinate in solving 
Schrodinger's equation, we treat the angular shape of the survey as circular, and then 
simply stretch the wave-function appropriately. 



3.3. Isolating the Independent Modes 

We have made our choice of modes in order to minimize the off-diagonal elements 
in the covariance matrix. However, we have not guaranteed that all of our modes are 
linearly independent. If the survey volume is V, then we can imagine gridding the density 
distribution of the survey into N 3 cells. For large N, we can fully describe the density field 
on scales of L/N with fy x iV 3 numbers, where: 

fv = ^- (23) 

Thus we expect only a fraction, fy < 1, of our Fourier modes to be independent on small 
scales. For the geometry of the SDSS, this fraction is given by fy = 0.176. 

Since, in general, this suggests that the covariance matrix is singular, we can use 
singular value decomposition (SVD), to create a new basis set. Examining the first n 
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Fourier modes, we determine two unitary matrices, u and v such that 

C = uCv t , (24) 

where C is a diagonal n x n matrix containing the eigenvalues, Sj, of C in decreasing order. 
Thus, by truncating at some critical value of s m /si, we may look at only the first / s = m/n 
modes. For very large values of n, it is our expectation that f s approximately approaches 
fy, since both represent a quantitative account of the amount of information per Fourier 
mode. In general, however, f s < fy at large n, because ostensibly independent modes may 
be of such low signal to noise ratio that SVD rejects them. 

We label C m as the upper m x m elements of C, as all other elements are very close 
to zero. We define the n x m matrices u m and v m as the first m column vectors of u and 
v, respectively. These matrices can be used to transform an n x n matrix into an m x m 
matrix. We can similarly transform any arbitrary matrix, D to the basis set by: 

V m = ul D v m . (25) 

By transforming dC/d6i in this way, we can readily compute the matrices Aj = C^C m ^, 
where C m ,i is the transform of dC/d9i, and (C" 1 )^ = S^ b /s a , from which follows the Fisher 
matrix. 

In order to determine the condition number s m /s\ at which f s approaches fy, we 
look at the distribution function of eigenvalues of the covariance matrix. Figure ^ shows 
this distribution for covariance matrix sizes n = 63, 342, and 999, corresponding to 
k = (3,6,9) x 2tt/L. This was done assuming P(k) = constant and ignoring shot noise, 
because we want to probe the geometry of the survey, and not a particular power spectrum 
model. The horizontal dashed line is drawn at fy = 0.176, which crosses the n = 999 line 
at s m /si = 0.004. For n > 342, the distribution function asymptotes to a uniform curve, 
and therefore we can choose the condition number to be 0.004 independent of our value of 
n, and match the requirement f s ~ fy. 



3.4. The Size of the Covariance Matrix 

Calculating the Fisher matrix requires performing an SVD on the covariance matrix. 
If we make no simplifying assumptions, a grid of N different values of k requires an N x N 
matrix. Performing the SVD requires ~ iV 2 calculations, and transforming a matrix to the 
basis set requires ~ iV 3 calculations, not to mention the fact that each element in the matrix 
is a three dimensional integral over a non-symmetric function (equation |I^)! For a survey of 
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the size of the SDSS (~ 10 6 particles), it is reasonable to have as many independent modes, 
and using the method outlined above, vastly overwhelm the limits of both computing time 
and of storage. 

However, our task may be simplified somewhat. We have reason to suspect that 
that after some critical mode, n crit , we may approximate the covariance matrix as being 



approximately diagonal, since at small scales, the convolution in equation ( |20| ) becomes 
more and more an unbiased estimator of P(k), as the edge effects become less and less 
important. 

Figure [| represents the increasing sparseness of the covariance matrix pictorially, where 
we assume the SDSS geometry and selection function, again using the simplified model of a 
constant power spectrum with no shot noise. In Figure |]a, we show the first 10000 modes of 
the covariance matrix (selecting each 100 th mode). Figure |]b shows the covariance matrix 
for the first 100 terms, and Figure ^c shows the 1000 th to 1100 th terms. In each case, the 
shading is linearly proportional to the covariance between the modes. The off-diagonal 
terms become smaller for the higher modes as the edge effects become less important. We 
should therefore be able to truncate analysis of the full covariance matrix at some point, as 
we discuss in the following subsection. 

However, we cannot simply consider the covariance matrix to be completely diagonal 



beyond some n crit , given the argument in § 3.3 about the overcounting of independent 



modes. We therefore consider a likelihood function for the ridiag modes from n cr u + 1 to 
n cr it + ndiag, using the diagonal covariance matrix approximation: 

+ Y n diag ln(27r) . (26) 
From this, the diagonal estimate Fisher matrix elements are: 




pdiag _ fv_ °"L^* a9 dC nn / 89 j dC nn j 89 j 



E ^/-.^/^ , (27) 

Z n=n crit +l U 1i U »i 



Thus, we will treat the covariance matrix in two parts: a square matrix Q s i uare with 
k < k ncrit , and a diagonal matrix Q dta 9 with k > k ncrit . Simple algebra demonstrates 
F = Y square + F d%a9 . We now turn to the determination of an appropriate value for n cr it- 



3.5. At What Point May We Use the Diagonal Covariance Matrix? 



In order to determine at what n crit we can start treating the covariance matrix as 
diagonal, we have computed both the full and diagonal covariance matrices for a power 
spectrum model with Q — 1, fij, = 0.057, and og = 1.6, with the effects of shot noise 
included. We performed an SVD on the full covariance matrix using s m /s\ = 0.004 
determined above. We then computed the Fisher matrix element corresponding to the 
normalization of the power spectrum for both the full and diagonal estimates, as a function 
of maximum mode number, n. 

In order to compare these two estimates, let us define the Fisher function Fu(n) as 
the value of Fa computed for the first n terms of the covariance matrix. To compare Fa 
computed using the full and diagonal covariance matrices, we introduce an improvement 
function: 

Tin) = % ,, V ) . (28) 
1 ' Ft l \n) 1 ^ 

It is our hope, of course, that for some value of n, the improvement function tends to 
unity, after which we can safely apply only diagonal terms without a significant loss of 
information. 

In Figure |5|, we present the results of this exercise. Figure [|a shows the improvement 
function as a function of mode number, while Figure [|b shows the behavior of the Fisher 
function, F(n), for the full matrix (solid) and the diagonal elements only (dashed). The 
two curves approach one another at n = 728. 

We thus adopt n crit = 728, and treat the covariance matrix as diagonal for larger /. In 
summary, we have several reasons to argue that we can do this: 

1. The covariance matrix becomes increasingly sparse at higher values of \k\ (Figure J|); 

2. The Fisher information matrix using the diagonal and the full covariance matrix gives 
similar results at n = 728 (Figure 0); 

3. By n = 728, the distribution function of eigenvalues (Figure and therefore the 
condition number required to make f s ~ /y, asymptotes to a constant value. 



4. Results 



-12- 



4.1. Expected fif, Uncertainties for Various Cosmologies 

In this section, we compute the uncertainties of cosmological parameters used in a 
typical model power spectrum using the Fisher information matrix. We will present 3 
models, each with h = 0.65, n pr i m = 1, and A = 0. Model 1 is a flat universe (Qq = 1) 
with a baryonic component consistent with nucleosynthesis estimates = 0.024/i -2 ) and 
a value of a$ = 1.6. Model 2 is a flat universe with no baryonic component and = 1.6. 
Finally, Model 3 is an open universe, f2o = 0.3, with Qb = 0.024/i -2 and erg = 0.64. All three 
models are consistent with COBE normalization for their respective cosmologies (Bunn & 
White 1997 and references therein). The power spectra of Models 1 and 3 were shown in 
Figure [I]. 

We also include a Bayesian prior Fisher matrix, which takes into account the fact that 
we do have prior information about h, n pr i m , and Qq. We assume that these quantities have 
Gaussian-distributed, uncorrelated (and quite conservative) errors AQ = 3, Ah = 0.5, and 
Ariprim = 1. The prior log-likelihood function is: 

£prfar(0) = 53 ( (g '~^° )2 + MA^o)) + N ln(27r) . (29) 



The contribution to the Fisher matrix is 

ij 1 —* u i,0 u ij ' 

where Ftf al = F? DSS + Ff" m . 

% J % J b J 



F!"° r = AO* , (30) 



Note that as we add data, the measured uncertainties will quickly dominate, and hence 
our final error estimates will not be strongly linked to our Bayesian priors. 

Figure || shows A9i = {F~ l )\( 2 , the computed parameter uncertainties for each of these 
models. If we include all the Fourier modes down to a physical scale of 20/i _1 Mpc (where 
the evolution of the power spectrum is expected to become non- linear), we find that the 
uncertainty in Q b is 0.070, 0.060, and 0.016 for the three models, respectively. Models 1 
and 3 have a value of Qb = 0.057, which means that, at the very least, we could check 
for inconsistency with Qj, — at the 0.8 and 3.5 a level, respectively. This suggests that 
studying redshift surveys in an open universe will give us a much better handle on than 
will a closed universe, as the size of the wiggles depends on Qb/^o (Figure [I]). 

Note that Ads varies by a factor of ~ 2 from the open to closed models, roughly the 
ratio of the values of as used in each model. This gives a fractional uncertainty which is 
roughly constant between models. Likewise, we estimate similar fractional uncertainties in 
Qq for all three models. 
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Finally, it worth noting that there is no discontinuity in the behavior of any of the 
estimated parameter uncertainties at n = 728 modes, where we start treating C as diagonal. 
This reassures us that the transition to a diagonal covariance matrix does not introduce 
any gross feature in the Fisher matrix. 



4.2. A Different Set of Parameters 

The parameters used in the parameter estimation, ©, are not independent of each 
other; for example, Qq, h, and Qf, are coupled through T (equation |2|). This is reflected in 
the off-diagonal elements of the Fisher matrix, which is large for pairs that are coupled. 
Thus the covariance between Qq and Qb, 

fJ'ij = 7 71/2 ' (31) 



33 



is 30% on small scales, and appreciably larger on larger scales. We thus are motivated to 
use a set of parameters that are closer to orthogonal, to minimize their uncertainty. In 
particular, the parameter Y sets the overall shape of the power spectrum, and the amplitude 
of the wiggles goes roughly as Qb/^o- Therefore, we now consider the parameter set, 
T = {as, T, Qb/Qo, h, n pr i m }, since these quantities should be closer to independent. 

We use the chain rule to convert between one data set and another: 



ab 



92 C \ - V F "* " j (12) 



0T a dT h lJ dT a dT t 



l :J 



Figure [7] plots the results of the parameter rotation. The uncertainties in T are about 
three times larger in the flat universe models than the open universe model, corresponding 
to very similar fractional errors in each model. The uncertainties of the parameter Qb/^o 
behave consistently with the results determined in the previous section. That is, all three 
models produce roughly the same uncertainty in Qb/^o, about 0.06, corresponding to 
the values of Qb/^o inconsistent with zero at the ~ la level in Model 1, and the ~ 3a 
level in Model 3. Note that models 1 and 2 have sharp downturns in Ar at around the 
threshold of the nonlinear regime. This is due primarily to the fact that the wiggles 
appear most prominently in flat cosmologies at around those scales (see Figure p. The 
covariance between flb/^o and T is only 15% in this regime, and thus both parameters can 
be determined with greater accuracy. 
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4.3. A Comparison with Band Estimates of the Power Spectrum 

We check for consistency of the above method with another, perhaps more intuitive 
estimate of the power spectrum. In this section, we divide /c-space into bands, and 
determine the errors of P(k) in each band. From there, we can compute the uncertainties 
in the power spectrum parameters using the Fisher formalism. 

FKP give an estimate for the uncertainty in P(k) for an approximately isotropic survey. 
Though this is not strictly true in our case, this approach is merely to serve as an order of 
magnitude consistency check on the final errors. Moreover, on smaller scales, isotropy is a 
fairly good assumption. The expected error for each band is: 



(AP(k)) ( (2vr) 3 / d 3 r t^{t)^(t) 



i - * 1/2 



n(r)P(k) 



P(k) I 14[/d 3 rn 2 (r)^(r)]' 



(33) 



where is the volume of the shell in fc-space; we will use the weight function ip shown in 
Figure |2|. If we take spherical shells, the volume of the i th shell is T4 = ~ hf-i)- We 

set the bands to be spaced at increments of k = 2ir/L, where L is the characteristic length 



defined in § |3.1| , as the modes should be approximately independent with this spacing. The 
FKP model ignores covariance between modes, and we will use this approximation here as 
well. 

The Model 1 power spectrum (f2 = 1, f)& = 0.057) and the resulting errors are plotted 
in Figure 

The log-likelihood function for some dataset of band-averaged power spectrum modes 
can be determined using a standard \ 2 expression because we have neglected covariance: 

£Mp 4?(w !+in H + ^ in(2 ^' (34) 



Here, Pi is the value of P(ki) computed using a model, Pi is the measured value of the 
power spectrum within the I th band, and APi is the associated error computed above. The 
Fisher information matrix follows straightforwardly (see also Tegmark 1997): 

_ sr dP i dPi 1 

^-^rnlW.JKpJ^ (35) 

Note that is insensitive to bin size; if we were to take smaller bins, we would have 
more terms contributing to the sum in equation ([35|), each with smaller and therefore 
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larger AP (equation [3^). In Figure |l we present the comparison of the final errors on flf,, 
<7g, and Qq determined in this way with those determined using the full covariance matrix, 
as a function of the maximum value of \k\ used in the analysis. The two are in fairly good 
agreement, but in general, the band estimate produces smaller estimated uncertainties 
than does the covariance matrix analysis. This is not surprising, since the band estimates 
assume that each band can be measured independently from one another, and thus, it 
neglects covariance between different bands. This serves to underestimate the true error. 
Furthermore, the FKP approximation uses the assumption of isotropy and weighs all of the 
modes within a band equally (see Tegmark 1995). Finally, in the full covariance matrix 
analysis, covariance between individual modes within a given band can give information 
about P{k). However, both the FKP approximation and the diagonal covariance matrix 
analysis ignore this. 



5. Other Complicating Factors 

We have assumed that the observed galaxy field reflects the mass density field on the 
length scales that we probe. Furthermore, we assumed that the redshifts measured represent 
the actual distances of the galaxies, and thus we have an accurate, three-dimensional map 
of the density field. We also assumed that the clustering of our density field behaves in the 
same way throughout the volume of the survey. Finally, we assumed that linear theory 
describes the evolution of the power spectrum perfectly down to some scale. We briefly 
address the validity of these assumptions, and what may be done to compensate for them 
when they do not hold. A more comprehensive review of some of these issues appears in 
Strauss & Willick (1995). 



5.1. Evolution of Clustering 

The SDSS survey will extend to a redshift of z ~ 0.2. As a result, we would expect that 
local structure will have a clustering amplitude which is larger than at the furthest reaches 
of the survey due to the growth of structure with time. Let A(z) be the ratio of power at 
redshift, z, to redshift 0; A(z) = 1/(1 + z) 2 for Qq = 1, in linear theory. We can correct the 
density perturbations at any given redshift, z, to account for the cluster evolution that will 
take place between z and z = 0: 

5{k) = J d 3 r e ik r A(^)- 1/2 ^(r)5(r) , (36) 

and where z and r are related through the standard cosmological equations. 
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Following this through, we find that the window function is redefined as: 

W(k) = J d 3 r e ik 'M(z)- 1 / 2 V'(r) . (37) 

Using the revised definition of the window function, the covariance matrix becomes 

C lm = = J d'k'Pik^Wih - k')W(k m - k'Y + / d'rAiz)- 1 ^ 2 ^ . 

(38) 

A(z) depends on Qq and to a lesser degree A. Therefore, including this effect breaks the 
degeneracy between Q and h, although further experiments need to be done to quantify 
how large an effect this actually is. 

5.2. Redshift Space Distortions 

Throughout this paper, we have assumed that we know the full three-dimensional 
position of each galaxy. However, in a real survey, we do not know the distance of each 
galaxy, only its redshift, which differ due to peculiar velocities. On small scales, the redshift 
distribution differs from the real space distribution via the "fingers of God" , the elongated 
structures seen along the line of sight due to motions of galaxies in a virialized system. On 
larger scales, gravitational collapse causes structure to appear compressed along the line of 
sight. 

If our galaxy survey were confined to a sufficiently small angle on the sky, we would 
be able to use a fairly simple approximation in linear theory to account for these effects 
(Kaiser 1987; Cole, Fisher, & Weinberg 1994): 

5(k) s = 5(k) fl (l + /V) , (39) 

where 5g and <5r are the Fourier transforms of the redshift and real density perturbations, 
respectively, (5 = f(Q )/b, where b is the biasing parameter, and /(fio) is the logarithmic 
derivative of the fluctuation growth rate, and the cosine of the angle between k and the line 
of sight is pi. 

The more general case is discussed by Zaroubi & Hoffman (1996). Using linear 
perturbation theory, they find the general relation between the redshifted and real Fourier 
modes. They show that different values of k become coupled due to redshift distortions. 
Since we actually "measure" <5(k) s , but the real density perturbations are given by S(k) R , 
one needs to compute the covariance matrix of the redshift space modes in terms of the 
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real space perturbations and their redshift distortions. This of course introduces a new- 
parameter, /3, to be included in the Fisher matrix. The expressions of Zaroubi & Hoffman 
(1996) are quite complicated, and introduce further coupling between modes, which may 
affect the applicability of the techniques used in this paper. In the Appendix, we discuss 
an alternate basis set that has the potential to reduce this problem for redshift space 
distortions. 

Finally, Nakamura et al. (1997) and de Laix & Starkman (1997) discuss the effects 
of space curvature on the redshift space distortions in the SDSS galaxy survey, and show 
that these effects need to be taken into account for a proper treatment of redshift space 
distortions. 



5.3. Biasing and Selection Effects 

We have further ignored the fact that we are dealing with the galaxy density field, 
rather than the unbiased matter density field. Here we mention several effects that biasing 
may give. 

First, we have assumed that a galaxy density fluctuation of a certain scale, 5 g (r) is 
equal to the density fluctuation of matter, 5 dm (r)- Indeed, the two may be related by some 
constant of proportionality, b, such that S ga i(r) = b 8dm{t)- The normalization parameter, 
cr 8 , which is proportional to P(k) 1 ^ 2 , is also proportional to b, and thus is not a major 
concern. 

A more serious issue comes from the fact that biasing is not necessarily linear, nor is 
it deterministic. The bias can be a strong function of scale and can have finite scatter (cf., 
Cen & Ostriker 1992), and hence, the normalization (and the other relevant parameters) 
are no longer corrected in a non-trivial way. 

We have further made the assumption that the bias is independent of galaxy luminosity. 
Since the galaxies at high redshift are at the bright end of the luminosity distribution in a 
flux-limited sample, the measured galaxy power spectrum at high z will be different from 
that found locally if this assumption breaks down. We could avoid this problem by using a 
volume-limited survey. However, this would increase the shot noise; further calculations are 
needed of the Fisher matrix in this case to see how this affects A£l b . 

Finally, we have made the assumption that there is no substantial evolution in galaxy 
populations since z ~ 0.2. This further complicates matters in that the bias parameter can 
be a function of z (e.g., Fry 1996) as well as local density. 
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These objections notwithstanding, future improvement in our understanding of 
evolution, biasing, and the universal luminosity function can certainly be incorporated 
into the Fisher formalism. Once these effects are included in the covariance matrix, the 
error estimation proceeds exactly as we have seen, with the possible addition of further 
parameters which describe these effects. 

5.4. Nonlinear Effects 

We have treated the evolution of the power spectrum as well described by linear theory 
down to a scale of 20/i~ 1 Mpc, and assumed that everything below that scale was nonlinear. 
However, this may not be a concrete limit. Both analytic (Taylor & Hamilton 1996; Jain & 
Bertschinger 1994) and numerical (Baugh & Efstathiou 1994) analyses suggest that for some 
models, the linear regime may extend to even smaller scales. This produces two competing 
effects. First, nonlinearity tends to smooth out features in the power spectrum, reducing 
the size of the wiggles, and thus, limiting the information about fib- However, if we have a 
good model of the power spectrum which does extend well into the non-linear regime, we 
can also extend our covariance matrix to smaller scales. Though we expect features like the 
wiggles to be smoothed out (cf., Tegmark 1997), we nevertheless may expect to get more 
information on T, and with that parameter more tightly constrained, our uncertainties in 
fib decrease as well due to the covariance between them. 

6. Conclusions 

We have discussed the possibility of measuring fl b from the power spectrum in the 
forthcoming SDSS. In order to simplify matters, we have assumed uniform biasing, and 
that there are no redshift space distortions and no evolution of either the density fields or 
of the constituent galaxies. Our approach is to use the Fisher information matrix, which 
uses the covariance matrix of the data to put minimum error bars on derived parameters 
via the Cramer-Rao inequality. 

We decompose the density field of a galaxy redshift survey into discrete Fourier modes 
in units of 27r/L, where the characteristic length, L = 1466/i -1 Mpc, is given by the survey 
geometry. The data are placed in order of decreasing physical scale. This ordering is ideal 
in the sense that a given mode number is associated with a physical scale, and the signature 
of a high baryon density (the wiggles) are scale dependent. 

Computational limits do not allow us to consider the full covariance matrix for more 
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than ~ 1000 Fourier modes, corresponding to k = 9 x 2 ir/L. We show, however, that on 
these scales, the covariance matrix becomes accurately diagonal, and we use the diagonal 
approximation for n > 728 modes. We take care to account for the correct number of 
independent modes, using singular value decomposition for n < 728, and a calculation of 
the effective volume of our sample for larger n. Calculations are continued to physical 
scales of 20/i~ 1 Mpc, where it is expected that the evolution of the power spectrum becomes 
significantly nonlinear. 

We have applied this method to several different models, including a flat (Qq — 1), 
and an open (Qq = 0.3) model, each with = 0.024 ± 0.002/i -2 , and a flat model with 
no baryons. In the models with baryons, we found that the uncertainties were inconsistent 
with fi 6 = at the 0.8a level in a flat cosmology, and at the 3.5a level in an open cosmology. 
The quantities fib/fio, which describes the amplitude of the wiggles, and T, which describes 
the overall shape of the power spectrum, should be close to orthogonal in the fits. We 
found that Qb/^o is inconsistent with fib = at the la and 3a in the flat and open models, 
respectively. Finally, we compared this result with the estimate of uncertainties which can 
be derived from band estimates of the power spectrum and found qualitatively consistent 
results, all of which lends great confidence to the idea that in an open universe, may be 
independently measured from galaxy redshift surveys. 

We have ignored in this paper a number of physical effects that need to be understood 
before we can claim a definitive calculation of the power spectrum and its errors. We 
plan to use numerical simulations to include the effects of nonlinearities; if these can be 
properly modeled, it is possible that errors on parameters can be tightened considerably. 
Distortions to the density field due to galaxy peculiar velocities (redshift-space distortions) 
are known to be an important effect; including them would complicate the analysis 
considerably, although the basis set described in the Appendix may simplify the problem. 
The effects of space curvature and the evolution of clustering are probably less important, 
but are straightforward to include. Finally, scale- and luminosity-dependent galaxy biasing 
are additional complications which are difficult to model realistically, but it should be 
straightforward to give rough estimates for how important these effects are on the estimation 
of quantities such as fi&. 

The authors would like to gratefully acknowledge David Spergel, Michael Vogeley, Max 
Tegmark and Michael Blanton for invaluable discussions. We thank David Weinberg for the 
use of his mock catalog. M.A.S. acknowledges support from the Alfred P. Sloan Foundation. 
D.M.G. is supported by an NSF Graduate Research Fellowship. 
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A. An Alternate Basis Set 

Throughout the paper, we have used the Fourier modes of the density field as our basis 
set. However, we could imagine other basis sets that take the SDSS geometry better into 
account. D. Spergel (private communication) suggests using a form of spherical harmonics 
and spherical Bessel functions for the basis set, with basis vectors given by: 



where 

yim 



eimn = yim{0A)ji{k n r) , (Al) 

H+l<'-™>'lV(f5£)^*. (A2) 

V cos 6> n ) v 



4tt (l + m)\ 

Here, P™ are the Legendre polynomials, are a set of functions related to the spherical 
harmonics, ji(x) are the spherical Bessel functions, k n is the n th value of k satisfying 
ji(k n R) = at the edge of the survey, and 9 is the radial angle of the circular cone which 
defines our space. 

It is straightforward to show that this forms a complete, orthonormal basis set within 
a circular cone of opening half-angle 9q, embedded in a sphere of radius R. The SDSS 
geometry is fairly similar. Therefore, we might imagine that using such a basis set will give 
us a very nearly diagonal covariance matrix. The resolution scale of these modes are related 
to n max and l max , the maximum values of n and /, respectively (see Fisher et al. 1995 for a 
discussion). 

We can first decompose the density field into modes of this basis set: 

Imax I T^max 

6 ( r ) = E E E Bj lmnJl {k n r)y lm (0A) , (A3) 

;=0 m=—l n=l 

where Bi n is a normalization constant. The transform conjugate is given by 

5 lmn = J d 3 r 3l (k n r)yU0,mr) . (A4) 

Without loss of generality, we can add a separable weighting function 
■0(f) = ifj(r)ifj(6,<j)). This is included analogously to the weighting function used 
with the Fourier modes, and indeed, the method suggested by Tegmark (1995) is separable 
in radial and angular parts, and thus may provide the best solution to limit covariance. 



is: 



Following Fisher et al. (1993), the expectation value of the product of any two modes 
(5imJ* Vm >n>) = J d3r i J d 3 r 2 £(r 1 -r 2 )ji(k n r 1 )j l ,(k n ,r 2 )y; m (9 1 ,(f) 1 )y l , m ,(92,(i) 2 ) 
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+ / ^ jiMMkn.rwuo, m^Ao, t>Krfm 0) 2 (A5) 



>A 

n(r) 

We will label the latter term Cf^°*/ m / n /. 

Following Fisher et al. (1995), we use the Rayleigh expansion to find: 

Air 

e ,k. r = Y,i l ji{kr cos9 )yt m (9, ( f>)y l m(Ok,<Pk) , (A6) 

COS (7n , 

u l,m 

where 9k and fc are the spherical coordinates of k. 

Using this, and the definition of the autocorrelation function, we see: 

£(ri - r 2 ) = — - yi"m"{9 1 ,4> 1 )i;{9 1 ,4> 1 )y lllmll {9 2 , <h)^{0 2 , fa) 

71 cos u o V , m u 

J k 2 dkP(k)ji> l (kricos9 )?p(r 1 )ji> / (kr2Cos9o)?p(r2) . (A7) 

Plugging this in, we get: 

2 f 2 

Clmnl'm'ri = 7T ^-Iml" m" m ' I" m" \ ^kk P(k) 

71 COS tf j// m // J 

y drirlj^knTijji^kri cos6» )^(n) 

1 dr2rlj v {k nl r2)jv,{kr2 cos 9 )^(r 2 ) + C^ Vm , n , , (A8) 

where we have defined: 

A lmVm , = J duyt m (9, <j>)y Vm ,{6, (f>)4>(9, 0) , (A9) 

In the case where we have perfect sky coverage over a circle of radius 9 and uniform 
ip(6,<f>), the A coefficients go to Ai mVm i = Sf[,S^ m , : which simplifies the form substantially. 
Indeed, since there is only covariance between different values of n, the covariance matrix 
becomes block diagonal. Each block can be constructed as a covariance matrix of constant 
I and m and varying n. This would simplify storage and inversion tremendously, allowing 
us to use all the observed data. 

Perhaps even more importantly, redshift distortions also only couple different values of 
n (Fisher et al. 1994, 1995; Heavens & Taylor 1995), and so this formalism should allow 
these distortions to be modeled without a great deal of additional complications. 
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There are difficulties, however. First, the survey is not perfectly circular, and hence 
there is some covariance between different values of I and m. Moreover, there is a double 
sum and a triple integral in this form of the covariance matrix (compared with only a triple 
integral in the Fourier picture). Moreover, the A coefficients, though not model specific, 
still need to be computed for a given geometry, and stored in a four dimensional array. 
Finally, computing spherical Bessel functions and spherical harmonics are time consuming. 

Still, this basis set offers the possibility of an even better approximation of 
diagonalizability than the Fourier modes, and may be useful for future consideration. 
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Fig. 1. — (a) Comparison of 4 power spectra, each with h = 0.65, n pr i m = 1, and T = 0.58. 
The solid line is the numerical result from CMBfast with Q = 1.0, = 0.057. The short- 
dashed line is the analytic power spectrum from equation [I] using the same parameters. 
The dotted line is the numerical result for Qq = 0.89, Qb = 0. The long-dashed line is the 
numerical result for fl Q = 1.37, Qb = 0.26, corresponding to the same ratio of flb/^o as in 
the open model in (b). (b) A comparison of the numerical (solid) and analytical (dashed) 
model power spectra for an open cosmological model, with Q = 0.3, Qb = 0.057, T = 0.173. 
(c) The ratio of the analytic to numerical power spectra for Qq = 1 and Qb = 0.057. (d) The 
ratio of the analytic to numerical power spectra for fl = 0.3 and fib = 0.057. 
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Fig. 2. — (a) The selection function (solid), and the radial weight ip(r) used in weighting the 
galaxy catalog, (b) The angular behavior of the weight, ip(6), where 9 is the angle from the 
central axis of the survey cone. Since the radial and angular parts of the weighting function 
are completely separable, the normalization of each component is arbitrary. 
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Fig. 3. — A plot of the fraction of eigenvalues s that are greater than some fractional value, 
for several different maximum numbers of modes. The dotted line is 63 modes, the dashed 
line is 342 modes, and the solid line is 999 modes. The horizontal line represents fy = 0.176. 
As n increases, the curves asymptote to a uniform form, and that at n — 999 modes, a value 
of s m /si = 0.004 gives approximately f s = fy. 
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Fig. 4. — A grey-scale map of the weighting used in the covariance matrices, ordered in 
increasing value of \k\. The weighting goes as: / d 3 k'W(k' — ki)W(k' — k 2 )*. (a) 1/100 of 
the first 10 4 modes, (b) The first 100 modes. (c)The 1000 - 1100 th modes. Note that the 
matrix gets sparser and sparser as we look at higher and higher modes. 
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Fig. 5. — (a) The improvement function, Z(n), for the normalization of the power spectrum 
as a function of the number of modes, n. (b) The Fisher function, F(n). The solid line 
is computed using the full covariance matrix, while the dashed line is computed using the 
diagonal elements only. The above analysis goes out to k = 8 x 2n/L, which corresponds to 
the first 728 modes. 




Fig. 6. — The computed uncertainties for (a) as, (b) Qq, and (c) Qb for various cosmo logical 
models. The solid line is Model 1, the flat, f2& = 0.057 model. The dotted line is Model 2, the 
flat Slf, = model. The long dashed line is Model 3, the Q = 0.3, Qb = 0.057 model. The 
short-dashed vertical line is at the mode corresponding to A = 20/i~ 1 Mpc, the approximate 
limit of the linear regimes. The horizontal dashed lines are the actual values used in each of 
the models. 
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Fig. 7. — The computed uncertainties of (a) T and (b) Q b /Q . The solid line is Model 1, the 
dotted line is Model 2, and the long dashed line is Model 3. 
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Fig. 8. — The associated errors on the Model 1 (Q = 1, £l b = 0.057) power spectrum and 
its parameters, (a) The errors on the banded power spectrum using the FKP error bars, 
(b) The uncertainty on Qf, using FKP errors (solid) compared with the uncertainty of Q b 
computed using the covariance matrix (dashed), as a function of \k\. (c) The errors for a 8 . 
(d) The errors for Q . 



